The Nist 2004 Spring Rich Transcription Evaluation: Two-axis Merging Strategy in the Context of Multiple Distant Microphone Based Meeting Speaker Segmentation

نویسندگان

  • Corinne Fredouille
  • Daniel Moraru
  • Sylvain Meignier
  • Laurent Besacier
  • Jean-François Bonastre
چکیده

This paper presents the ELISA speaker segmentation approach applied on multiple audio channel meeting recordings in the framework of NIST RT’04s meeting (spring) evaluation campaign. As done for BN data speaker segmentation, the ELISA “meeting” system involves two speaker segmentation systems developed individually by the CLIPS and LIA laboratories. The main originality consists in a “two-axis” merging strategy, proposed to deal with both multiple expert segmentation outputs and multiple microphone segmentation outputs. While expert merging strategy did not really lead to an improvement of the performance, the individual microphone segmentation merging strategy allowed to provide a global segmentation output from several audio channels (microphones) with acceptable performance. The best system obtained 22.6% of diarization error rate during the NIST RT’04s meeting evaluation.

منابع مشابه

The Rich Transcription 2004 Spring Meeting Recognition Evaluation

This paper presents the design and results of the Rich Transcription 2004 Spring Meeting Recognition Evaluation. The evaluation included both Speaker Segmentation (SPKR) and Speech-to-Text Transcription (STT) tasks. Three microphone type conditions were supported: Multiple Distant Microphones (the primary condition of interest), Single Distant Microphone (SDM), and Individual Head Microphones (...

متن کامل

Speaker diarization for meeting room audio

This paper describes a speaker diarization system in 2007 NIST Rich Transcription (RT07) Meeting Recognition Evaluation for the task of Multiple Distant Microphone (MDM) in meeting room scenarios. The system includes three major modules: data preparation, initial speaker clustering and cluster purification/merging. The data preparation consists of the raw data Wiener filtering and beamforming, ...

متن کامل

Using direction of arrival estimate and acoustic feature information in speaker diarization

This paper describes the IR/NTU system submitted for the NIST Rich Transcription 2007 (RT-07) Meeting Recognition evaluation Multiple Distant Microphone (MDM) task. In our implementation, the Direction of Arrival (DOA) information is specifically used to perform speaker turn detection and clustering. Cluster purification is then carried out by performing GMM modeling on acoustic features. Final...

متن کامل

Speaker segmentation and clustering in meetings

This paper describes the issue of automatic speaker segmentation and clustering for natural, multi-speaker meeting conversations. Two systems were developed and evaluated in the NIST RT-04S Meeting Recognition Evaluation, the Multiple Distant Microphone (MDM) system and the Individual Headset Microphone (IHM) system. The MDM system achieved a speaker diarization performance of 28.17%. This syst...

متن کامل

The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars

In this paper, we describe the IBM system submitted to the NIST Rich Transcription Spring 2006 (RT06s) evaluation campaign for automatic speech activity detection (SAD). This SAD system has been developed and evaluated on CHIL lecture meeting data using far-field microphone sensors, namely a single distant microphone (SDM) configuration and a multiple distant microphone (MDM) condition. The IBM...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004